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1. In a vector space representing the latent semantic content of a collection of 
documents, a method for discerning the presence of at least one sense of a subject term, 
the method comprising: 

determining at least one cluster of documents within the vector space, each cluster 
corresponding to a subset of documents within the vector space containing 
a subject term, and 

determining an implicit position within the vector space of at least one sense of 
the subject term, the implicit position corresponding to at least one 
determined cluster. 

2. The method in accordance with Claim 1, wherein the vector space is a latent 
semantic indexed vector space. 

3. In a collection of documents, each document containing a plurality of terms, a 
method for discerning the presence of at least one sense of a subject term, the method 
comprising: 

forming an m by n matrix, where each matrix element (/, j) corresponds to the 

number of occurrences of term i in document j; 
performing singular value decomposition and dimensionality reduction on the 

matrix to form a latent semantic indexed vector space; 
determining at least one cluster of documents within the vector space, each cluster 

corresponding to a subset of documents within the vector space containing 

a subject term; and 

determining an implicit position within the vector space of at least one sense of 
the subject term, the implicit position corresponding to at least one 
determined cluster. 

4. In a collection of n documents and a reference collection, each document 
containing terms, the reference collection containing at least one meaning associated with 
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a term, the total number of terms occurring at least once in the document collection equal 
to at least m, a method for determining a meaning for a sense of a subject term, the 
subject term found in at least one document and associated with at least one meaning, the 
method comprising: 

forming an m by n matrix, where each matrix element (/, j) corresponds to the 

number of occurrences of term i in document j; 
performing singular value decomposition and dimensionality reduction on the 

matrix to form a latent semantic indexed vector space; 
determining at least one cluster of documents within the vector space, each cluster 

corresponding to a subset of the document collection, each member of the 

subset having at least one occurrence of a subject term; 
discerning an implicit position of a sense of the subject term, each implicit 

position corresponding to at least one determined cluster; 
discerning at least one non-subject term within the vicinity of the implicit position 

of the sense; and 

assigning to the sense having a discerned implicit position, the meaning, 
associated with the term in the reference collection, that correlates best 
with the discerned non-subject terms closest to the implicit position of the 
sense. 

5. In a vector space representing the latent semantic content of a collection of 
documents, and in a reference collection comprising at least one meaning associated with 
a term, a method for determining a meaning for a sense of a subject term, the subject term 
found in at least one document and associated with at least one meaning, the method 
comprising: 

determining at least one cluster of documents within the vector space, each cluster 
corresponding to a subset of the document collection, each member of the 
subset having at least one occurrence of a subject term; 
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discerning an implicit position of a sense of the subject term, each implicit 
position corresponding to at least one determined cluster; 

discerning at least one non- subject term within the vicinity of the implicit position 
of the sense; and 

assigning to the sense having a discerned implicit position, the meaning, 
associated with the term in the reference collection, that correlates best 
with the discerned non-subject terms closest to the implicit position of the 
sense. 

6. The method in accordance with Claim 5, wherein the vector space is a latent 
semantic indexed vector space. 

7. In a collection of n documents and a reference collection, each document 
containing terms, the reference collection containing at least one meaning associated with 
a term, the total number of terms occurring at least once in the document collection equal 
to at least m, a method for determining a meaning for an occurrence of a subject term, the 
subject term found in at least one document and associated with at least one meaning, the 
method comprising: 

forming an m by n matrix, where each matrix element (i, j) corresponds to the 

number of occurrences of term / in document j\ 
performing singular value decomposition and dimensionality reduction on the 

matrix to form a latent semantic indexed vector space; 
discerning the position, within the vector space, of an occurrence of a subject 

term; and 

assigning to the occurrence, the meaning, associated with the subject term in the 
reference collection, that correlates best with non-subject terms closest to 
the implicit position. 

8. In a vector space representing the latent semantic content of a collection of 
documents, and in a reference collection comprising at least one meaning associated with 
a term, a method for determining a meaning for a occurrence of a subject term, the 
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4 subject term found in at least one document and associated with at least one meaning, the 

5 method comprising: 

6 determining at least one cluster of documents within the vector space, each cluster 

7 corresponding to a subset of the document collection, each member of the 

8 subset having at least one occurrence of a subject term; 

9 discerning an implicit position of a sense of the subject term, each implicit 

10 position corresponding to at least one determined cluster; 

1 1 discerning at least one non- subject term within the vicinity of the implicit position 

12 of the sense; and 

O 13 assigning to the sense having a discerned implicit position, the meaning, 

, n 14 associated with the term in the reference collection, that correlates best 

™ 15 with the discerned non-subject terms closest to the implicit position of the 

U\ . .~ . 

'^j 16 sense. 

ru 

p 1 9. The method in accordance with Claim 8, wherein the vector space is a latent 

j| 2 semantic indexed vector space. 

|;Q 1 10. In a collection of n source documents and a collection of x reference documents, 

^ 2 each document containing terms, each reference document containing at least one 

□ 3 meaning associated with a term, the total number of terms occurring at least once in the 

4 combination collections equal to at least m, a method for determining a meaning for a 

5 sense of a subject term, the subject term found in at least one source document and 

6 associated with at least one meaning, the method comprising: 

7 forming an m by [n + x] matrix, where each matrix element (/, j) corresponds to 

8 the number of occurrences of term i in document j; 

9 performing singular value decomposition and dimensionality reduction on the 

10 matrix to form a latent semantic indexed vector space; 

1 1 determining at least one cluster of documents within the vector space, each cluster 

12 corresponding to a subset of the [n + x] documents having at least one 

13 occurrence of a subject term; 
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discerning the implicit position of at least one sense of the subject term 
corresponding to at least one determined cluster; and 

assigning to at least one sense corresponding to at least one discerned implicit 
position, the meaning of the subject term closest within the vector space to 
the implicit position of the sense. 

11. The method as described in Claim 10 wherein each document in the reference 
source corresponds to one meaning. 

12. In a collection of n source documents and a collection of x reference documents, 
each document containing terms, each reference document containing at least one 
meaning associated with a term, the total number of terms occurring at least once in the 
combination collections equal to at least m, a method for determining a meaning for an 
occurrence of a subject term, the subject term found in at least one source document and 
associated with at least one meaning, the method comprising: 

forming an m by [n + x] matrix, where each matrix element (/, j) corresponds to 

the number of occurrences of term i in document j; 
performing singular value decomposition and dimensionality reduction on the 

matrix to form a latent semantic indexed vector space; 
discerning the position, within the vector space, of an occurrence of a subject 

term; and 

assigning to the occurrence, the meaning, associated with the subject term, closest 
to the implicit position of the sense. 

13. The method as described in Claim 12 wherein each document in the reference 
source corresponds to one meaning. 

14. In a collection of documents, each document containing a plurality of terms, a 
computer-implemented method for discerning the presence of at least one sense of a 
subject term, the method comprising: 
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4 forming an m by n matrix, where each matrix element (/, j) corresponds to the 

5 number of occurrences of term i in document j\ 

6 performing singular value decomposition and dimensionality reduction on the 

7 matrix to form a latent semantic indexed vector space; 

8 determining at least one cluster of documents within the vector space, each cluster 

9 corresponding to a subset of documents within the vector space containing 

10 a subject term; and 

1 1 determining an implicit position within the vector space of at least one sense of 

12 the subject term, the implicit position corresponding to at least one 
/^/y^ determined cluster. 

^tl 15( In a collection of documents, each document confining a plurality of terms, a 

2 oomputer program product for discerning the presence of at least one sense of a subject 

3 term when executed on a computer system, the computer program product comprising: 

4 a computer-readable medium; / 

5 a matrix-forming module stored on the medium that forms an m by n matrix, 

6 where each matrix element (/,/) corresponds to the number of occurrences 

7 of term / in document j; / 

8 a singular value decomposition and dimensionality reduction module stored on 

9 the medium and couple to tfte matrix forming module that forms a latent 

10 semantic indexed vector space from the matrix; 

1 1 a clustering module stored on ine medium that determines at least one cluster of 

12 documents within tly vector space, each cluster corresponding to a subset 

13 of documents withm the vector space containing a subject term; and 

14 a sense position determining module stored on the medium an implicit position 

15 within the vector space of at least one sense of the subject term, the 

16 implicit position corresponding to at least one determined cluster. 
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1 16. In a collection of documents, each document containing a plurality of terms, a 

2 computer program product comprising instructions that when executed perform the 

3 method comprising the steps of : 

4 forming an m by n matrix, where each matrix element (/, j) corresponds to the 

5 number of occurrences of term / in document j; 

6 performing singular value decomposition and dimensionality reduction on the 

7 matrix to form a latent semantic indexed vector space; 

8 determining at least one cluster of documents within the vector space, each cluster 

9 corresponding to a subset of documents within the vector space containing 

10 a subject term; and 

1 1 determining an implicit position within the vector space of at least one sense of 

12 the subject term, the implicit position corresponding to at least one 

13 determined cluster. 
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